PUBLISHED IN IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, MAY 2010 (PRINT AVAILABLE @ IHTTP://DX.DOl.ORG/I0.II09/TWC.20I0.05.09"044Tl I 



Turbo Packet Combining for Broadband 
Space-Time BICM Hybrid-ARQ Systems with 

Co-Channel Interference 

Tarik Ait-Idir, Member, IEEE, Houda Chafnaji, and Samir Saoudi, Senior Member, IEEE 



o 



C/2 



> 

^' 

O 

o 



X 



Abstract — In this paper, efficient turbo packet combining for 
single carrier (SC) broadband multiple-input-multiple-output 
(MIMO) hybrid-automatic repeat request (ARQ) transmission 
with unknown co-channel interference (CCI) is studied. We pro- 
pose a new frequency domain soft minimum mean square error 
(MMSE)-based signal level combining technique where received 
signals and channel frequency responses (CFR)s corresponding 
to all retransmissions are used to decode the data packet. We 
provide a recursive implementation algorithm for the introduced 
scheme, and show that both its computational complexity and 
memory requirements are quite insensitive to the ARQ delay, 
i.e., maximum number of ARQ rounds. Furthermore, we analyze 
the asymptotic performance, and show that under a sum-rank 
condition on the CCI MIMO ARQ channel, the proposed packet 
combining scheme is not interference-limited. Simulation results 
are provided to demonstrate the gains offered by the proposed 
technique. 

Index Terms — Automatic repeat request (ARQ) mechanisms, 
multiple-input-multiple-output (MIMO), single carrier (SC), un- 
known co-channel interference (CCI), intersymbol interference 
(ISI), frequency domain methods. 



I. Introduction 

SPACE-TIME-BIT-INTERLEAVED coded modulation 
(ST-BICM) with iterative decoding is an attractive signal- 
ing scheme that offers high spectral efficiencies over multiple- 
input-multiple-output (MIMO)-intersymbol interference (ISI) 
channels |T|, fT], f3l, H, Q. To combat ISI in single carrier 
(SC) broadband ST-BICM transmission, frequency domain 
equalization, initially introduced for single antenna systems 
16 K 1 7 1, 1 8 1, 1 9 1, has been proposed using iterative (turbo) 
processing |10|. It is a receiver scheme that allows high 
ISI cancellation capability at an affordable complexity cost. 
In practical systems, unknown co-channel interference (CCI) 
caused by other transmitters (distant users and/or neighboring 
cells) who simultaneously use the same radio resource can 
dramatically degrade the link performance. This limitation can 
be overcome by using the so-called hybrid-automatic repeat 
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request (ARQ) protocols, where channel coding is combined 
with ARQ lini, llJ2l. In hybrid- ARQ, erroneous data packets 
are kept in the receiver and used to detect/decode the re- 
transmitted frame HI, HI, ES), HD, 113, GSl, im. This 
technique is often referred to as "packet combining" . Practical 
packet combining schemes have been addressed in l20| . In 
|2r|, an elegant information-theoretic framework has been 
introduced to analyze the throughput and delay of hybrid- 
ARQ under random user behavior Interestingly, the authors 
have shown that hybrid-ARQ systems are not interference 
limited, i.e., arbitrarily high throughput can be achieved by 
simply increasing the transmit power of all users even when 
multi-user detection (MUD) techniques are not used at the 
receiver Motivated by the above considerations, we investigate 
efficient low-complexity turbo frequency domain reception 
techniques for SC broadband ST-BICM signaling with hybrid- 
ARQ operating over CCI-limited MIMO channels. 

The powerful diversity-multiplexing tradeoff tool, initially 
introduced by Zheng and Tse for coherent delay-limited, 
i.e., quasi-static, MIMO channels |22|, has been elegantly 
extended by El Gamal et al. to MIMO ARQ channels with 
flat fading, and referred to as diversity-multiplexing-delay 
tradeoff |23|. The authors have proved that the ARQ delay, 
i.e., maximum number of ARQ protocol rounds, improves the 
outage probability Q performance for large classes of MIMO 
ARQ channels |23|. In particular, they have demonstrated that 
the diversity order can be increased due to ARQ even when 
the MIMO ARQ channel is long-term static, i.e., the MIMO 
channel is random but fixed for all ARQ rounds. The diversity- 
multiplexing-delay tradeoff has then been characterized in 
the case of block-fading MIMO ARQ channels, i.e., multiple 
fading blocks are allowed within the same ARQ round |25l. 
In [26], the outage probability of MIMO-ISI ARQ channels 
has been evaluated under the assumptions of short-term static 
channel dynamic and Chase-type ARQ |27|, i.e., the data 
packet is entirely retransmitted. It has been shown that, as 
in the flat fading case, ARQ presents an important source of 
diversity, but its influence becomes only minimal when the 



In non-ergodic, i.e., block fading quasi-static channels, the outage prob- 
ability is a meaningful measure that provides a lower bound on the block 
error probability. It is defined as the probabiHty that the mutual information, 
as a function of the channel reahzation and the average signal-to-noise ratio 
(SNR), is below the transmission rate |24|. 

^In the case of short-term static dynamic, the ARQ channel realizations 
are independent from round to round. This dynamic applies to slow ARQ 
protocols where the delay between two rounds is larger than the channel 
coherence time. 
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ARQ delay is increased. This observation suggests that the 
design of practical packet combining schemes should target 
a high diversity order for early ARQ rounds. Supplementary 
retransmissions are then used to correct rare erroneous data 
packets, when they occur. 

More recently, packet combining for MIMO ARQ sys- 
tems has been investigated (e.g. |l28l, ED, EO), EH, 11^21 . 
II33I . II34I . lf35l . ll36l ). Turbo combining techniques, where 
decoding is iteratively performed through the exchange of 
soft information between the soft-input-soft-output (SISO) 
packet combiner and the SISO decoder, have been proposed 
for the MIMO-ISI ARQ channel using unconditional minimum 
mean square error (MMSE)-aided combining IJTII . ll26l . These 
approaches have then been extended to broadband MIMO code 
division multiple access (CDMA) systems with ARQ | [38) . 
Time domain turbo packet combining for CCI-limited MIMO- 
ISI ARQ channels has been introduced in f39l|. 

In this paper, we investigate efficient turbo receiver tech- 
niques for SC ST-BICM transmission with Chase-type ARQ 
over broadband MIMO channel with unknown CCI. We in- 
troduce a frequency domain MMSE-based turbo packet com- 
bining scheme, where all ARQ rounds are used to decode the 
data packet. By using an identical cyclic prefix (CP) word 
for multiple retransmissions of a symbol block, we perform 
transmission combining at the signal level. The frequency 
domain soft MMSE packet combiner performs soft ISI can- 
cellation and retransmission combining in the presence of 
unknown CCI jointly over all received signal blocks. We also 
provide an efficient recursive implementation algorithm for the 
proposed scheme, and show that both the computational load 
and memory requirements are quite insensitive to the ARQ 
delay. The complexity order is only cubic in terms of the 
number of transmit antennas. Received signals and channel 
frequency responses (CFR)s corresponding to all ARQ rounds 
are used without being required to be stored in the receiver We 
analyze the asymptotic performance of the proposed combin- 
ing scheme. Interestingly, we show that under a rank-condition 
on the MIMO ARQ channel corresponding to unknown CCI, 
the proposed combining scheme is not interference-limited, 
i.e., unknown CCI can be completely removed. Finally, we 
provide numerical simulation results for some scenarios to 
validate our findings. 

The remainder of the paper is organized as follows. In 
Section |ll] we describe the ARQ system under consideration, 
along with the communication model in the presence of 
unknown CCI. In Section|IIIl we present the frequency domain 
turbo packet combining scheme we propose in this paper, 
and analyze both its complexity and memory requirements. In 
Section HVl we carry out the asymptotic performance analysis, 
and provide representative numerical results that demonstrate 
the gains achieved by the proposed scheme. Finally, we point 
out conclusions in Section FV] 

Notation: 

• Superscripts *, ^, and ^ denote conjugate, transpose, and 
Hermitian transpose, respectively. E [.] is the mathemati- 
cal expectation of the argument (.). 

• Let X be a square matrix, diag {X} denotes the row 
vector corresponding to the diagonal of X, and tr {X} 



denotes the ti-ace of X. When Xi,--- ,Xm e C^^'^, 
diag{Xi,-- - ,Xm} denotes the MN x MQ matrix 
whose diagonal blocks are Xi, • • • , Xm- diag {x} is the 
N X N diagonal matrix whose diagonal entries are the 
elements of the complex vector x £ . (X)^^ denotes 
the mth diagonal entry of matrix X. 

> Ijv is the N X N identity matrix, and Onxq denotes an 
all zero N x Q matrix. For i = 0, • • • , T — 1, jv is a 
N X NT zero matrix where the ith x block is equal 

to Iat. 

• Operator (8) denotes the Kronecker product, and 5m,n is 
the Kronecker symbol, i.e., (5„j_„ — 1 for m = n and 
5m,n = for m 7^ n. 

> For each sequence of matrices Xq, • • • ,Xt-i (respec- 
tively, scalars xq, • • • , xt-i), X = y denotes 
its time average (respectively, x = ^ J2i=o 

• XJt is a T X T unitary matrix whose (m, n)th element is 
(UT)™,„ = ;^cxp{-j-22irHi} form, ,r- 

1, where j = ^tm is TN x TN defined as 

UT,Af = Ut ® Itv. 
« For each vector x e C^^, x/ denotes the discrete Fourier 
Transform (DFT) of x, i.e., x/ = Uq x. 

• The acronym i.i.d. means "independent and identically 
distributed". 

II. ARQ System Model 

A. SC-MIMO ARQ Transmission Scheme 

We consider an SC multi-antenna-aided transmission 
scheme where the transmitter and the receiver are equipped 
with Nt transmit (index t = ,Nt) and Nj^ receive 

(index r — ,Nfi) antennas, respectively. The MIMO 

channel is frequency selective and is composed of L symbol- 
spaced taps (index I ^ 0, - ■ ■ ,L — 1). The energy of each tap 
I is denoted af, and the total energy is normalized to one, i.e.. 

Each information block is initially encoded then interleaved 
with the aid of a semi-random interleaver 11. The resulting 
frame is serial to parallel converted and mapped over the 
elements of the constellation set S to produce symbol matrix 
S e ^Ntxt^ where T is the number of channel use (c.u). A 
CP word, whose length is Tcp > £ — 1, is then appended 
to S, thereby yielding matrix S' e 5JVrx(T+Tcp)^ 'pjjis 
allows the prevention of inter-block interference (IBI) and the 
exploitation of the multipath diversity of the MIMO broadband 
channel. We suppose that no channel state information (CSI) 
is available at the transmitter and assume infinitely deep 
interleaving. Therefore, transmitted symbols are independent 
and have equal transmit power, i.e., 

E[st^,S*,^,,] ^St^t',^-^'. (1) 

At the upper layer, an ARQ protocol is used to help correct 
erroneous frames. An acknowledgment message is generated 
after the decoding of each information block. Therefore, when 
the decoding is successful, the receiver sends back a positive 
acknowledgment (ACK) to the transmitter, while the feedback 
of a negative acknowledgment (NACK) indicates that the 
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Figure 1. SC-MIMO ARQ communication scheme at ARQ round k. 



decoding outcome is erroneous. Let K denote the ARQ delay, 
and k = 1 , • • • ,K denote the ARQ round index. When the 
transmitter receives an ACK feedback, it stops the transmission 
of the current block and moves on to the next information 
block. Reception of a NACK message incurs supplementary 
ARQ rounds until the packet is correctly decoded or the ARQ 
delay K is reached. We focus on Chase-type ARQ, i.e., the 
symbol matrix S' is completely retransmitted. In addition, we 
suppose perfect packet error detection, and assume that the 
one bit ACK/NACK feedback is eiTor-free. 



B. Communication Model in the Presence of Unknown CCI 

The broadband MIMO ARQ channel is assumed to be short- 
term static fading, i.e., the channel independently changes 
from round to round. Note that this channel dynamic applies to 
slow ARQ protocols where the delay between two consecutive 
ARQ rounds is larger than the channel coherence time. It also 
applies to orthogonal frequency division multiplexing (OFDM) 
systems where frequency hopping is used to mitigate ISI. 



Let H, 



(fe) 



(fc) 

L-l 



e (j^WfixWT denote channel matrices 



at the kth ARQ round, and whose entries are i.i.d. zero- 



mean circularly symmetric Gaussian, i.e., ft, 



(k) 
r,t,l 



where ^ denotes the fading channel corresponding to path 
I and connecting the tth transmit and the rth receive antennas 
at the fcth ARQ round. Therefore, the channel energy at each 
receive antenna r is 



L-1Nt 
1=0 t=l 



'■r,t,l 



Nt 



(2) 



The channel profile, i.e., power distribution ctq, • • • , aj^_-^ and 
number of taps L, is supposed to be identical for at least K 
consecutive rounds. This is a reasonable assumption because 
the channel profile dynamic mainly depends on the shadowing 
effect. 

Transmitted data blocks are corrupted by an unknown CCI 
signal caused by a co-channel transmission that uses N!p 
transmit antennas (index t' = 1, • • • ,N^) and T c.u. The 
link between the interferer transmitter and the receiver is 
composed of L' taps, where the channel matrix of each tap 



,i' - 1 at round k is HCCI<" g c^bxn^ 



its energy is a^^^ 0. We suppose that the receiver has no 
knowledge either about the interferer CSI or about its channel 
profile and number of transmit antennas (i,.e., parameters N^, 
L', HCCI'^', and al , V/' are completely unknown at the 
receiver). As the desired user, the interferer employs a CP- 
aided transmission strategy. Its transmitted symbols s^'^^^'^'at 
each round k verify the independence/energy-normalization 
condition ([T]i as useful symbols. Therefore, the signal-to- 
interference ratio (SIR) at each receive antenna is given as 

Nt 



SIR = 



(3) 



We assume perfect frame synchronization between the in- 
terferer and the desired user They can differ in terms of 
the CP word length, which depends on the delay of the 
multipath channel, but are synchronized in terms of the useful 
symbol frames. Under this assumption, CP deletion yields the 
following baseband received Nji x 1 signal at round k and 
channel use i, 

L-l 

/ . -'■I' °(i-;')modT 
=0 



(fc) 



1=0 



— CCI+noisc 



(4) 

where n,- ^ CA^ (Oat^xi, c^Iatj,) denotes the receiver 
thermal noise. The SC-MIMO ARQ communication scheme 

at round k is depicted in Fig. [1] In the following, we assume 

f fc) 

perfect channel estimation at each ARQ round k (i.e., HJ 
Wl are perfectly known) while CCI channel matrices H^'-'^*''' 
yi' , k are completely unknown at the receiver side. 

1) Single-Round Communication Model: To derive the 
block communication model corresponding to ARQ round fc, 
we consider the following block signal vector. 



Xk) 



iky 
Vo 



iky 



(5) 



that groups signals corresponding to the entire symbol frame. 
Vector y'*^') can be expressed as. 



(6) 



^^The ARQ processes corresponding to tlie desired user and tlie interferer 
are not necessarily synchronized. Therefore, the round index k appearing in 
the CCI channel matrices only refers to the index of a realization of the 
interferer channel at ARQ round k. The same remark holds for CCI symbols 
in 13). Also, note that Y.^'iSg crj 
between the interferer and the receiver 



;/_Q ^u,, 7^ 1 in order to account for the path-loss 
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where 



H 



(fe) 



'NrxNt 



(7) 
(8) 



H 



(fc) 



ilL 1 



H 



(fe) 

L-l 



'NrxNt 



H 



(fe) 



L-l 



(fe) 



OathxtVt OAfflxTVT ■■• H^, J ^^^^^^^ 

(9) 

is a block circulant matrix that can be block diagonalized in 
a Fourier basis as 

hW =U|^,^^A('^-)Ut,jv,, (10) 

where 

^diag{A('^\... ,a(?1i} eC^«^x^-^. (11) 

Exploiting (fTOb and the block circulant structure of H^'^'), we 
get 

Af) = X:H['=)exp|-,^|. (12) 



Applying the DFT 'Ut,Nb. signal vector y^'^^ yields the 
single-round frequency domain communication model 



(13) 



where y^^^ s/, and w^'''' denote the DFT of y*-'^-', s, and w^*^), 
respectively. 

2) Multi-Round Communication Model: Let us suppose 
that received signals and channel matrices corresponding to 
ARQ rounds 1 , • • • ,k are available at the receiver. First, we 
introduce the signal vector notation 



Mr 



(14) 



where received signals corresponding to multiple ARQ rounds 
are grouped in such a way to construct kNfj virtual receive 
antennas. Similarly, we define. 



(fe) A 



H 



(1)^ 



^kNRXNT 



(fe) A 

w; = 



... W 



(ky 



e C 



kNn 



(15) 
(16) 



The block signal vector that serves for jointly performing, 
at ARQ round k, packet combining and equalization in the 
presence of CCI is constructed similarly to (|5]l. 



(fe) A 



yik)' 



... v^'^^ 
I '^Lt— 1 



and can be expressed as, 

y 



W =hWs + wW, 



(17) 



(18) 



ARQ round #1, 



♦ | deinterleaverj — 



— I interleaver [ *■ 
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check. 
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Figure 2. Block diagram of the receiver scheme at ARQ round k 



where 



(k) A 



(fe)'^ (fc)"^ 



^kNRT 



(19) 



Matrix H^'"' has the same structure as (|9]l, where its first 
TkNji X Nt block column is equal to 



H 



{ky 



7''' :H^l-^,OjVrx(T-L)fcA'R 



H*^'^-' can be factorized similarly to (fTOl i as. 



where 



aC^) ^ diag { a('=) , • • • , A^'li } e C^^^^ 



xNtT 



, (k) A 



,A1 



kNftxN-i 



(20) 

(21) 

(22) 
(23) 



(fe') 

and matrices A) , k' ~ 1, ■ ■ ■ ,k, are given by ( fT2] i. The 
multi-round frequency domain communication model at ARQ 
round k is then expressed as. 



y«=AWs,+wW, 



(24) 



where y^.'^^ and w^.'^'' denote the DFT of y^*^^ and 
respectively. 

III. Frequency Domain Turbo Packet Combining in 
THE Presence of Unknown CCI 

A. General Description 

At each ARQ round, the decoding of a data packet is 
performed by iteratively exchanging soft information in the 
form of log-likelihood ratio (LLR) values between the soft 
packet combiner, i.e., the joint transmission combining and 
equalization unit, and the soft-input-soft-output (SISO) de- 
coder. Let us suppose that, at ARQ round fc, all received 
signals and channel matrices corresponding to previous rounds 
k — 1 , • • • ,1 are available at the receiver. Note that this 
assumption could not be feasible in practice since the receiver 
will require a huge memory. In Subsection IIII-DI we show 
that the proposed turbo packet combining algorithm requires 
little memory while it uses signals and CSIs corresponding to 
all ARQ rounds 1, • • • , fc. The block diagram of the frequency 
domain turbo packet combining receiver at ARQ round k is 
depicted in Fig. |2l 
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First, the multiple ARQ rounds frequency domain block 
signal vector y^-'^'' = \JT,kNRY^^^ and CFR A'-'^-' are con- 
structed. Second, the soft packet combiner estimates the co- 
variance of unknown CCI plus noise, then computes the multi- 
transmission MMSE filter that takes into account both co- 
antenna interference (CAI) and ISI while suppressing unknown 
CCI. These two elements are then used with a priori informa- 
tion to compute extrinsic LLRs corresponding to coded and 
interleaved bits. The generated soft information is transferred 
to the SISO decoder to compute a posteriori LLRs about both 
coded and useful bits. Only extrinsic information is fed back to 
the soft packet combiner to help perform transmission combin- 
ing and equalization in the next turbo iteration. The iterative 
soft packet combining and decoding process is stopped after a 
preset number of turbo iterations and decision about the data 
packet is performed. The ACK/NACK message is then sent 
back to the transmitter depending on the decoding outcome. 
Note that during the first iteration a priori LLR values are the 
output of the SISO decoder obtained at the last iteration of 
previous round k — \. 

B. Properties of CCI plus Noise Covariance 

In this subsection, we focus on covariance properties of CCI 
plus noise present in both the single-round and multi-round 
communication models given by ^ and ( fTSl ), respectively. 
These properties present an important ingredient in the turbo 
packet combining algorithm we introduce in Subsection IIII-CI 

Let @k denote the covariance of CCI plus noise w 
present in received signal (01) at round k. 



(fe) 



{k) (ky 



•>Nt,xNb 



Let us group covariance matrices corresponding to rounds 
1 , • • • , fc in the block diagonal matrix 



Hfc =diag{0i,--- ,0fc}e 



^kNuXkNu 



(26) 



of the 



Proposition 1: The covariance H^, = E 

CCI plus noise block vector w''''^ present in the multi-round 
communication model (fTsl l after k rounds is expressed as 



^TkNaxTkNa 



Proof: The expression in dZTl ) is easily obtained by 
calculating the mathematical expectation of w^'^'w^'^' . In 
the derivation, we only exploit the independence between 
the enti-ies of Hp^i'" and Hpci^"') V/,/',fc, and k' (i.e., 
short-term static block fading dynamic of the CCI MIMO 
ARQ channel), and the fact that CCI symbols satisfy ([TJ. 
No assumption on the structure of the CCI block matrix is 
used. A detailed proof of ( |27] i in the case of sliding-window 
aided time-domain detection can be found in ||39l Subsection 
III.C]. 



Proposition 2: The covariance ^. 



E 



of 



the single-round CCI plus noise block vector w'^'^^ at ARQ 
round k is 

0fc = lT(»0fe. (28) 



Proof: The proof follows by simply invoking Proposition 
[U for one round. ■ 
Proposition 3: Covariance matrices of frequency domain 
CCI plus noise vectors Wy^'' and w^'"'' (corresponding to 
the DFTs of w''"' and w^'^', respectively) are Hfc and 0fc, 
respectively. 

Proof: The proof of Proposition |3] follows from the 
fact that Hfc and 0fc are block circulant and block diagonal 
matrices. ■ 
Proposition [T] indicates that the covariance of the multi- 
round CCI plus noise vector can be obtained by separately 
computing single-round covariances using Proposition |2l This 
result greatly impacts the computational complexity of the 
proposed algorithm as it will be shown in Subsection IIII-DI 

C. Proposed Scheme 

In this subsection, we derive the frequency domain MMSE- 
based soft packet combiner that cancels both CAI and ISI 
jointly over multiple ARQ rounds in the presence of unknown 
CCI. 

To combine signals corresponding to ARQ rounds 1 , • • • ,k, 
we use conventional soft parallel interference cancellation 
(PIC) (of both multi-round CAI and ISI) and unconditional 
MMSE filtering techniques |3|. Therefore, at each turbo it- 
eration of ARQ round k, the MMSE-based soft packet com- 

(k) 

biner produces a complex scalar decision / that serves for 
computing extrinsic LLR values corresponding to coded and 
interleaved bits mapped over symbol st.i- Let (pf. ^ denote the 
vector of a priori LLRs of bits corresponding to symbol i, 
and available at the input of the soft combiner at a particular 



^^^■^ turbo iteration, cr? 



E 



- {E[,smIVm]} 
By invoking either 



\st.i\ I Vt,,] 

denotes the conditional vaiTance of st^i 
the orthogonal projection theorem or Lagrangian methods, and 
using ( fTTl i and ( |27] |. soft MMSE-based packet combining at 
ARQ round k, can be performed in the frequency domain as. 



(29) 



where 7!^^ is the DFT of z^*^) = 



^NtT 



(fc) 
^1,0 ' ' 



§/ e C^^^ denotes the DFT 



(27) of the soft symbol vector s = E [s | ys^ ^ : V (t, i)] , and 



r^'^) =diag{A('^-)"B('='-\...,A(.^!:Br;}, 

=CW-lT®diag{(C('^-))i.i,... ,(CW)^,,,^J, 

(30) 



B 



(fe) 



(fe) 



(31) 



p Nt X Nt 



Matrices C and S denote time averages of Cf'' and S„ 
respectively, as defined in Section U The input for the soft 



demapper can be extracted from z^-'"'' as z. 



- TJ^ 

— *^t.i'~'T.NT 



,(fe) 



where et^i is the {iNr + t)th vector of the canonical basis. As 
it can be seen from the forward-backward filtering structure in 
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Table I 

Summary of Frequency Domain MMSE-Based Turbo Packet Combining in the Presence of CCI 



1.3. 
1.4, 



Compute the DFT of CSI and received signals at round k, i.e., Ag'^^ ■ • ■ , Ay_li and y^^*^', respectively. 



Initialization 

Initialize {Df} and y^"' with OjVyxJVr ™'l OiVTXii respectively. 
Packet Combining at ARQ round k 
1.1. 

1.2. For each turbo iteration 

1.2.1. Compute the soft symbol vector s and variances a^^, then deduce the DFT s/ and S. 

Estimate the CCI plus noise covariance 0^ at round k using i32i . then compute &^^. 

Compute {Af '"©-^Af ^Ifj-o' and aW"(It ® &-,')yf\ 
Deduce {D-''' } Js'q and y^'°' using recursions i34i and ( |38| l, respectively. 

Compute the inverses of {E + D^*' } Jj^^, then construct (Cf ' } JSq^ and F*''' using iglll 
and 1 147 1 . respectively. 

Compute forward and backward parts r'^^'y^^' and fl'^^'s/ using i36i and i30l . respectively. 
Calculate extrinsic LLRs. 
Perform SISO decoding. 
End 1.2. 



1.2.2. 
1.2.3. 
1.2.4. 
1.2.5. 



1.2.6. 
1.2.7. 
1.2.8. 
1.2.9. 



If "frame error" then store {D^ } and y^*"' computed at the last turbo iteration, and send "NACK", 



Otherwise, send "ACK" 



the frequency domain MMSE filter explicitly cancels soft 
CAI and ISI while it only requires the covariance of unknown 
CCI plus noise. Note that both Propositions [T] and |2] are used 
to derive (l29b . 

To obtain estimates of unknown CCI plus noise covariance 
matrices ©i, -- ,&k, required by dSTT i. let us consider the 
single-round frequency domain communication model ( flj] ). 
Proposition [3] indicates that the covariance of w^*^^ is 0^, ~ 
It ^ @k- Therefore, with respect to the block diagonal 
structure of ( fTsT l, unknown CCI plus noise covariance 0^ 
can directly be estimated in the frequency domain at each 
turbo iteration, with the aid of a priori LLRs, according to 
the following average. 



e. 



i=0 



H 



(32) 



and Sf . denote the DFTs of y'^'^' and s at frequency bin 

(k) (k) - 

I. respectively, i.e., y^. ^ = Ei^jVj,y} and Sf. ~ 'Ei^^T^f- 
Covariance matrices 0i,--- ,0fe-i are similarly estimated 
at ARQ rounds 1, • • • ,k — 1, respectively, and correspond to 
estimates obtained at the last turbo iteration. In other words, 
when the decoding outcome is erroneous, a NACK message is 
fed back to the transmitter, and the unknown CCI plus noise 
covariance estimate obtained at the last iteration is saved in the 
receiver to help perform packet combining at the next ARQ 
round. 



D. Implementation Aspects 

We first provide an efficient implementation of the proposed 
scheme since turbo combining requires at each turbo iteration 



the computation of matrix inverses Bq 



(ky 



,B 



{k)- 

T-l 



(^kNptxkNii given by ( |3TT i. Second, we analyze the computa- 
tional complexity and memory requirements of the proposed 
implementation algorithm. 



1) An Efficient Implementation Algorithm: The special 
structure of the frequency domain ARQ channel matrix ( l23b 
together with the matrix inversion lemma It40il allow us to 
express the inverse of B^*^' as. 



where D^-'"^ is obtained according to the following recursion. 



Af)"H,-i, (33) 



D 
D 



(fc) 

i 

(0) 



D 

i(fe) r'(fe) 



(34) 



Therefore, matrices Cj, , • • • , C^_-^ are simply computed as, 

Cf ^ = Df ^ - Df (s + Df ) ^ T>f\ (35) 

while the forward filtering part of ( |29l ) is calculated at each 
ARQ round k as. 



(36) 



where 

f('=) = diag <! - d;,"m s + d 



^0 



and y^*^^ is given by the following recursion. 



' ~(k) 



-AW"(lT®0,-^)yf 



(37) 



(38) 



'Nt X 1 ■ 



The proposed turbo packet combining algorithm is summa- 
rized in Table U Note that, during the first iteration of round fc, 
the anti-causal parts in recursions (l34l l and (|38] ). i.e., D^^^ 
and y'^^^^\ respectively, correspond to the output of these 
recursions at the last iteration of previous round /c — 1. 
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Table II 

Summary of Memory and Arithmetic Additions Required by the Proposed and LLR-Level Combining Schemes, and Relative Cost 



Evaluation 


Combining scheme 


Memory 


Arithmetic Additions 


Relative Costs 

QPSK 8-PSK 16-QAM 


LLR-Level 
Proposed 


TNt \o%2 \S\ 
2TNt {Nt + 1) 


TNtNu (ii--l)log2|5| 
2TNtNu (K - 1) (Nt + 1) 


Nt ^Nt - i iVr - 1 



R 



■Na 



1 



3Rx 



R 



N' 



NiiXNa 



1 



3Tx 



(42) 



2) Computational Complexity and Memory Requirements: 
The proposed recursive implementation algorithm avoids stor- 
ing received signals and CFRs corresponding to multiple ARQ 
rounds. It also prevents the computation of kNu x kNii matrix 
inverses. This dramatically reduces the implementation cost 
since the complexity order of directly computing is 
cubic against kNu, and is greatly increased from round to 
round. In the following, we analyze both the complexity and 
memory requirements of the proposed scheme, and compare 
them with those of the LLR-level combining technique Q 

First, note that in the case of LLR-level packet combining, 
frequency domain MMSE equalization is separately performed 
for each ARQ round. Therefore, T inversions of Np^ x Nn 
matrices are required to compute the forward and backward 
filters. Since in general it is required to have more receive 
than transmit antennas, especially when CCI is present in the 
system, an implementation similar to that introduced in the 
previous subsection is beneficial because only T inversions 
of Nt X Nt matrices will be required. In this case, the two 
variables in recursions ( |34] | and (l38T l are computed at ARQ 
round k as, D^'^^ = A.f^"@l^A.f^ e C^txNt^ and y^^^^ = 

A('=)"(It® e^^)y^''^ G C^^, while all the other steps in Ta- 
ble Uremain the same (including the CCI plus noise covariance 
estimation procedure in step 1. 2.2.). Therefore, by letting Nu 
denote the number of turbo iterations at each ARQ round, both 
combining algorithms have similar computational complexities 
since the proposed scheme and the LLR-level scheme require 
at most C+ ^ = 2TNtNu (K -1){Nt + 1) and 

new scheme ± ll \ j \ ± ^ j 

(^LLR-level = TNtNu {K - 1) log2 \S\ arithmetic additions 
to perform (|34] | and (|38] |. and to combine LLRs corresponding 
to multiple rounds, respectively. 

LLR-level packet combining performs the combination of 
extrinsic LLR values generated by frequency domain soft 
equalizers at multiple ARQ rounds. Therefore, a storage 
capacity of TNT\og2\S\ real values is required to store 
accumulated LLR values corresponding to all ARQ rounds. 

^In this paper, LLR-level combining refers to the iterative (turbo) packet 
combining and SISO decoding receiver, where transmissions con'esponding 
to k ARQ rounds are separately turbo equalized using k frequency domain 
MMSE soft equalizers. To perform packet combining at each iteration of ARQ 
round k, extrinsic LLR values generated by the soft MMSE equalizer at round 
k and those obtained at the last iteration of previous rounds 1 , • • ■ , fc — 1 are 
added together, then SISO decoding is performed. 



The proposed scheme combines multiple transmissions at the 
signal level using signals and CFRs corresponding to all ARQ 
rounds, without being required to be explicitly stored in the 
receiver. This is performed with the aid of the two variables 
Df'^ and y'''') in recursions (|34] | and (l38l l. respectively. This 
translates into a memory size of 2TNt [Nt + 1) real values. 
Therefore, the computational complexity and storage require- 
ments are less sensitive to the ARQ delay. The technique 
requires only a few more additions and a bit more memory 
compared to LLR-level combining. Table |II] summarizes im- 
plementation requirements and reports the relative costs for 
some modulation schemes. 

IV. Performance Evaluation 
A. Asymptotic Performance Analysis 

In the following, we provide a frame-basis analysis where 
we derive system conditions under which perfect CCI cancel- 
lation holds. We suppose that the interferer CSI is perfectly 
known, and investigate the influence of its channel properties 
on the interference cancellation capability of the proposed 
packet combining scheme in the high SNR regime. 

Theorem 1: We consider a CCI-limited MIMO ARQ sys- 
tem with Nt transmit and Nb. receive antennas, and ARQ 
delay K. Let 0^*^^ denote the CCI covariance at ARQ round 
k = 1, • • • ,K, i.e., the covariance of the global noise at the 
receiver is ©fc = + (t'^In„, and pk be the rank of 0^^^ 
We assume perfect LLR feedback from the SISO decoder 
The frequency domain soft MMSE packet combiner provides 
perfect CCI suppression for asymptotically high SNR if 

k 

Y,Pu< kNR - Nt. (39) 

u=l 

Proof: See the Appendix. ■ 
We now proceed to derive an upper bound on pk, where we 
incorporate the rank of the CCI fading channel. Under the 
assumption that CCI symbols satisfy ([T]i, i.e., infinitely deep 

""Relative costs refer to the relative number of arithmetic additions AC 
and memory AA/ required by the proposed scheme compared to LLR-level 
combining. With respect to storage requirements and number of arithmetic 
additions in Table |ll] we have AC = AM = 2 /^^+,}. . 
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interleaving, we get 



L'-l 



qCCI _ ^ pjCCI*''' jjCCl'*"' 



(40) 



('=0 



Let us write each CCI channel matrix as 



H[?ci<^'=R]V>F"^'^'</^ yi', (41) 

where A^^^' ' € ^Nrxn^ characterizes the scattering envi- 
ronment between the CCI transmitter and receiver 141], and 
and Rat^ are the correlation matrices controlling the 
receive and transmit antenna arrays, and are in general given 
by dHJ, where < (Jr^, ^tx < 1 iU. Note that gl^ 
corresponds to a general model of correlated fading MIMO 
channels, where the scattering radii at transmitter and receiver 
sides is taken into account, and A^*-'^ is not necessarily 
a full rank matrix, i.e., rank | A^*-^^*''' | < min (iV^, A'^^) 
ll4n . Noting that Rat^ and Rjv^ are full rank matrices, and 
with respect to the fact that CCI tap channel matrices are 
independent, and using (|40] i and (HTT i. we get 



L'-l 



Pk 



< min I Nr, rank <^ H^^^ H 



i'=0 
L'-l 



i('='t^cci<'=' 



= min <^ Nr, ^^nk A^d R^^^ A; 



i'=0 
L'-l 



< min <^ iV«, ^ rank {aCCI<'='U. 



A closer look at Theorem [T] and upper bound ( l43T l provides 
interesting system interpretations. 

• Impact of CCI Fading Channel: First, note that the CCI 
cancellation capability of the frequency domain MMSE 
packet combiner is related to the CCI channel rank. 
When the interferer has a rank-deficient channel matrix 
at a certain ARQ round, interference can completely be 
removed (at subsequent rounds) if the sum-rank condition 
in Theorem [T] is satisfied. In practice, the channel rank 
can dramatically drop in the case of the so-called pinhole 
channel, where the transmitter and receiver are largely 
separated and are surrounded by multiple scatterers [4l] . 
In this scenario, the channel can even prevent multipath 
from building up since the thin air pipe connecting trans- 
mitter and receiver scatterers is very long. For instance, 
in a system with Nr — 3 receive and Nt = 2 transmit 
antennas, and an unknown interferer who is experiencing 
one path (L' = 1) channel realizations with rank equal 
to two, CCI can be removed at the second ARQ round 
because the sum-rank condition (3% holds for k > 2. 

• Impact of the Number of Transmit Antennas and 
ARQ Delay: Condition (|39] | suggests how, for a given 
CCI channel profile, the number of transmit antennas Nt 
and ARQ rounds K are chosen to achieve perfect CCI 
cancellation. For instance, if transmission is corrupted 



by CCI with quasi-static channel rank 0, and if the 
ARQ delay allowed by the upper layer is K, then only 
Nt < K [Nr — po) transmit antennas can be allocated 
to the user of interest to achieve interference suppression 
at the latest at ARQ round K, where po is the rank of 
0^'~^\ i.e., Pk — po Vfc. Increasing the ARQ delay will 
relax the condition on the number of transmit antennas 
and therefore allow for an increase in the diversity 
and/or multiplexing gains depending on the diversity- 
multiplexing-delay trade-off operating point ll23l . Note 
that when N!p <C Nt, the CCI channel rank dramatically 
drops, and therefore CCI suppression is achieved even 
when a short ARQ delay K is required. 
> Interaction with the Scheduling Mechanism: In the 
case of opportunistic communications, interference with 
co-channel users who have high channel ranks can be 
prevented. For instance, when a retransmission is required 
on the reverse link, the base station (BS) can choose 
the timing of the next ARQ round in such a way that 
transmission simultaneously occurs with that of a user 
with low channel rank. This is feasible since the BS 
has complete knowledge about user CSIs in the reverse 
link. The same scheduling mechanism can be used in the 
forward link if all users provide the BS with feedback 
information about their channel ranks. When the system 
suffers from CCI caused by neighboring cells, the sum- 
rank condition ( [39] l can be achieved by simply increasing 
the number of ARQ rounds because the CCI channel rank 
tends to be constant over time. 



(43) 

B. Numerical Results 



In this subsection, we provide block error rate (BLER) 
performance results for the proposed combining technique. 
Our focus is to demonstrate the superior performance of the 
introduced scheme compared to LLR-level combining. We 
also evaluate BLER performance for scenarios where the 
interferer has rank deficient channel matrices to corroborate 
the theoretical analysis in Subsection IIV-AI 

In all simulations, we consider a BICM scheme where 
the encoder is a i-rate convolutional code with polynomial 
generators (35, 23)g, and the modulation scheme is quadrature 
phase shift keying (QPSK). The length of the code bit frame 
is 1032 bits including tails. The ARQ delay is K ~ 3, 
and the E^/No ratio appearing in all figures is the SNR 
per useful bit per receive antenna. We consider a L = 2 
path MIMO-ISI channel profile where aQ — af = ^. In 
practical wireless systems, the wireless channel may have 
more than two paths due to severe frequency selective fading. 
In this paper, we restrict ourselves to L = 2 for the sake 
of simulation simplicity. Performance in the case of severe 
frequency selective fading channels can be found in f38|. We 
use both the matched filter bound (MFB) per ARQ round 
and the outage probability |l26| of the CCI-free MIMO-ISI 
channel as absolute performance bounds to evaluate the CCI 
cancellation capability and diversity order achieved by the 

*In this case. CCI with quasi-static channel rank refers to an interferer 
whose channel rank is constant over multiple ARQ rounds. 
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Figure 3. BLER performance for CC (35, 23)g, QPSK, Nt = Nr = 2, Figure 5. BLER performance for CC (35, 23)g, QPSK, Nt = A, Nr = 2, 



L = L' = 2 equal energy patlis, and SIR = 3dB. 



L = L' = 2 equal energy paths, and SIR = 5dB. 
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Figure 4. BLER performance for CC (35, 23)g, QPSK, Nt = = 2, pigurg 6. BLER performance for CC (35, 23)g, QPSK, Nt = Nr = 2, 



L = L' = 2 equal energy paths, and SIR = 5dB 



L = 2 equal energy paths, Nt = 1, L' = 1, and pf: = 1, k = I, ■ ■ ■ , K , 
SIR = 3dB. 



proposed combining scheme. The number of turbo iterations 
is set to five and the Max-Log-MAP version of the maximum 
a posteriori (MAP) algorithm is used for SISO decoding. 

We first investigate performance for scenarios where the 
user of interest and the interferer have the same number of 
transmit antennas (Nt = N!p) and identical channel profiles, 
i.e., L = L', equal power taps, and CCI fading channel coeffi- 
cients are i.i.d. In Fig. |3] we compare the BLER performance 
of the proposed scheme with that of LLR-level combining for 
a ST-BICM code with rate R = 2, i.e., Nt = 2. The number 
of receive antennas is Nj^ — 2, and SIR = 3dB. We observe 
that the proposed scheme significantly outperforms LLR-level 
combining. The performance gap at ARQ round fc = 3 is about 
IdB for BLER < 10~^. Note that both combining schemes 
fail to perfectly cancel CCI since performance curves tend to 



saturate for high Eb/No values. Fig. |4] reports performance 
of both techniques when SIR is increased to SIR ~ 5dB. 
In this case, the performance gap between the two schemes 
is reduced. The CCI cancellation capability is also improved 
as can be seen from the steeper slopes of BLER curves. In 
Fig. |5] we evaluate the performance for a high rate ST-BICM 
code where R — 4, i.e., Nt — 4. Only iV^ — 2 receive 
antennas are considered, and SIR ~ 5dB. The proposed 
scheme dramatically outperforms LLR-level combining, i.e., 
the performance gap at ARQ round fc = 3 is about 4dB 
at 7 * 10^"^ BLER. The proposed scheme also offers higher 
cancellation capability and diversity order than LLR-level 
combining. 

We now turn to scenarios where the interferer has a rank- 



10 PUBLISHED IN IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, MAY 2010 (PRINT AVAILABLE @ |HTTP://DX.DOl.ORG/I0.1I09/TWC.2010.05.0904'4B 



CC(35,23)g, QPSK, M^=4, Np=4, SIR=1dB 



10 



§ r @ D ,g B - □ - H 3 - B - m 



O n 



■4 , b 



10 



UJ 



10 



10"' 



■Q- Scenario 1, l<=1 
■O- Scenario 1, lc=2 
<- Scenario 1, lc=3 
□ Scenario 2, l<=1 
O Scenario 2, l<=2 
< Scenario 2, k=3 
■a- MFB, l<=1 
O- MFB, l<=2 
<- MFB, l<=3 

Outage, R=4, K=3 



< b ' 

\ : \ - 

1- 
1 

<< 

t 

t 

Q 



^8 



-8 



-4 



10 



E^/Nj, per Rx ant. (dB) 



Figure 7. BLER performance for CC (35, 23)g, QPSK, Nt = Nr = 4, 
L = 2 equal energy paths, and SIR = IdB. Scenario 1: Nf^ = 4, L' = 4, 
Scenario 2: A^^ = 2, L' = 1, and = 2, fe = 1, • • ■ , /C. 

deficient uncorrelated MIMO channel, i.e., rank I A^*-'^' '| < 
min (A^y, iV/?) VZ', (5tx = (^rx = 0, and assume that the rank 
is constant over all ARQ rounds. In Fig.|6] we report the BLER 
performance of the proposed scheme for a CCI-limited MIMO 
system with settings similar to Fig. [3] i.e., Nt — N^t = 2, 
and SIR = 3dB. The interferer experiences flat fading, i.e., 
L' = 1, and only has — 1 transmit antenna. Therefore, 
with respect to (l43T l. pk — 1 Vfc. Note that in this interference 
scenario, the perfect CCI cancellation condition (l39T l holds for 
k > 2. We observe that both the CCI cancellation capability 
and the diversity order of the proposed scheme are improved. 
The performance gain with respect to the case of N!p = 2 and 
i' = 2 is about 1.5dB at 3 * lO^^ BLER and round fc = 3, 
and the slope of the BLER curve at round fc = 3 is similar 
to that of the MFB curve. Fig. |7] compares the performance 
of the proposed scheme for two scenarios with heavy CCI, 

1. e., SIR IdB. The ST-BICM code has rate R = A, i.e., 
Nt = 4, and the number of receive antennas is set to Nr = 4. 
In the first scenario (Scenario 1), the interferer has N!p = 4 
transmit antennas, L' = 2 equal power taps, and i.i.d. fading 
coefficients, while in the second scenario (Scenario 2), N!p ~ 

2, L' = 1, and the CCI channel rank is equal to two. Therefore, 
Pk = 2 Vfc, and condition (3% holds for fc > 2. It is clear that 
in the second scenario, better CCI cancellation capability is 
achieved for k > 2. For instance, the performance gap for 
fc = 3 is more than 2dB at 2* 10"^ BLER. Also, the diversity 
order of the CCI-free MIMO-ISI channel is almost achieved. 

V. Conclusion 

In this paper, we investigated efficient iterative turbo packet 
combining for broadband ST-BICM transmission with hybrid 
ARQ over CCI-limited MIMO-ISI channels. We have intro- 
duced a frequency domain turbo combining scheme where 
signals and CFRs corresponding to all ARQ rounds are com- 
bined in a MMSE fashion to decode the data packet at each 



round. The covariance of the overall (over all ARQ rounds) 
CCI plus noise required by the frequency domain MMSE 
soft packet combiner is constructed by separately computing 
the covariance related to each round. The proposed technique 
has a complexity order cubic against the product of the 
number of receive antennas and ARQ delay. This limitation is 
overcome by an optimized recursive implementation algorithm 
where complexity is only cubic in term of the number of 
transmit antennas. We evaluated the computational load and 
memory requirements, and found that the introduced recursive 
technique only requires few arithmetic additions and memory 
compared to conventional LLR-level combining schemes. We 
analyzed the effect of CCI channel rank on performance. In- 
terestingly, under a sum-rank condition, the frequency domain 
MMSE soft packet combiner can completely remove CCI 
for asymptotically high SNR. Finally, we provided simulation 
results where we showed that the proposed technique achieves 
BLER performance superior to LLR-level combining, and 
offers high CCI cancellation capability and diversity order for 
many interference scenarios. 

Appendix 

Proof of Theorem[T] 

Under the assumption of perfect LLR feedback from the 
SISO decoder, the frequency domain soft packet combiner 
output ( l29l l, at ARQ round fc, can be expressed as. 



''/perfect LLR 



(44) 



where A is the diagonal matrix of frequency domain symbol 
gains. 



)Nt,Nt^ ■ ■ ■ 1 



A = diag{(G<,'=))i,i,...,(GW); 

(G^l;^)l,l, • • • , (G^l^)Ary^Ary I , (45) 

with Gf ^ = Af "s^T^Af \ and x^''^ is the filtered CCI plus 
thermal noise at the output of the packet combining filter. Its 
covariance matrix is 

GW =diag{G('=\--- ,g(?1i}. (46) 

Now, let us examine the structure of matrix gI*^' for asymp- 
totically high SNR, i.e., 0. 

Let riill^, • • • , rifell^ be the low-rank decompositions of 
matrices • • • ,®k'^\ where Hi e C^^x^,... ,Uk € 

(j^Afff Xpfc Pqj. j-jjg g^j^g q£ notation simplicity, we write 
Y^u=i P'u- ~ P- follows that the rank of 11 = 
diag{ni,--- ,nfe} is p, and Hfc = 1111^ + cr^lfejvj, is 
a square invertible matrix. Therefore, it has an eigenvalue 
decomposition (E.V.D) that can be expressed as. 



Pp PkNn-p 



'-kNn-p 



pi/ 

P 

^kNR-p 



(47) 
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II 



where PP = P P — IuNr since is symmetric. This 
condition yields the following set of equalities, 

PfPp = Ip, (48a) 

^kNn-p^kNn-p = ikN^-p, (48b) 

PfFkNn-p - 0, (48c) 

PpPf + Pfcjv«-pPfjv«-p = (48d) 

Therefore, a Taylor expansion of H^T^ when — > 0, is given 
as, 

Hfei =PpT-ipf (49) 

Note that Y does not have any null diagonal element, i.e., T is 
invertible. Indeed, multiplying the left and right sides of ( |47] | 
by P^ and P, respectively, and with respect to (I48ab . we get, 
P^nn^Pp = T. By noting that P^II is pxp and has rank 

equal to p, it follows that Y"^ = (n^P^) (P^n)"\ 
Therefore, when 0, we have, 

^ )"p,T-ipf Af ) + a-^Af )"Af ) + O {a') . 

(50) 

Since the time domain channel coefficients are i.i.d., it follows 
that the kNf(^ x Nt matrix A,-*' has full-column rank unless 
all fading coefficients are equal to zero. If p+Nt < kNji, i.e., 
p < kNu — Nt, then all the first p columns of Hfe (column 
vectors of Pp) are in the kernel of A^^ ' , i.e., A^^ ' Pp — 
Ontxp- It follows that, when 0, 

= a-^Af Af + O (^2) . (51) 
Therefore, when SNR— > oo, we get 

SINK = ^tJ2 tr {Af Af-) } + O (a^) 

-^EEtr{HrHj")}+0(a^), (52) 

i=0 u=l 

SNRmf 

where SNRmf corresponds to the instantaneous matched filter 
(MF) SNR in the case of k rounds CCI-free MIMO-ISI ARQ 
channel. ■ 
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